NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Where The Wild Things Are: Brute-Force SSH Attacks In The Wild And How To Stop Them

Singh, Sachin; Gautam, Shreeman; Cartier, Cameron; Patil, Sameer; Ricci, Robert (April 2024, USENIX)

SSH (Secure Shell) is widely used for remote access to systems and cloud services. This access comes with the persistent threat of SSH password-guessing brute-force attacks (BFAs) directed at sshd-enabled devices connected to the Internet. In this work, we present a comprehensive study of such attacks on a production facility (CloudLab), offering previously unreported insight. Our study provides a detailed analysis of SSH BFAs occurring on the Internet today through an in-depth analysis of sshd logs collected over a period of four years from over 500 servers. We report several patterns in attacker behavior, present insight on the targets of the attacks, and devise a method for tracking individual attacks over time across sources. Leveraging our insight, we develop a defense mechanism against SSH BFAs that blocks 99.5% of such attacks, significantly outperforming the 66.1% coverage of current state-of-the-art rate-based blocking while also cutting false positives by 83%. We have deployed our defense in production on CloudLab, where it catches four-fifths of SSH BFAs missed by other defense strategies.
more » « less
Full Text Available
Arvin: Greybox Fuzzing Using Approximate Dynamic CFG Analysis

https://doi.org/10.1145/3579856.3582813

Shahini, Sirus; Zhang, Mu; Payer, Mathias; Ricci, Robert (July 2023, Proceedings of the 2023 ACM Asia Conference on Computer and Communications Security)

Full Text Available
A Year of Automated Anomaly Detection in a Datacenter

Ahmed, Rufaida; Porter, Joseph; Abdelmutalab, Abubaker; Ricci, Robert (October 2020, Proceedings of the 2nd workshop on Machine Learning for Computing Systems (MLCS))
null (Ed.)
Anomaly detection based on Machine Learning can be a powerful tool for understanding the behavior of large, complex computer systems in the wild. The set of anomalies seen, however, can change over time: as the system evolves, is put to different uses, and encounters different workloads, both its ‘typical’ behavior and the anomalies that it encounters can change as well. This naturally raises two questions: how effective is automated anomaly detection in this setting, and how much does anomalous behavior change over time? In this paper, we examine these question for a dataset taken from a system that manages the lifecycle of servers in datacenters. We look at logs from one year of operation of a datacenter of about 500 servers. Applying state-of-the art techniques for finding anomalous events, we find that there are a ‘core’ set of anomaly patterns that persist over the entire period studied, but that in to track the evolution of the system, we must re-train the detector periodically. Working with the administrators of this system, we find that, despite these changes in patterns, they still contain actionable insights.
more » « less
Full Text Available
In Datacenter Performance, The Only Constant Is Change

Duplyakin, Dmitry; Uta, Alexandru; Maricq, Aleksander; Ricci, Robert (May 2020, Proceedings of the Twentieth IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid)

All computing infrastructure suffers from performance variability, be it bare-metal or virtualized. This phenomenon originates from many sources: some transient, such as noisy neighbors, and others more permanent but sudden, such as changes or wear in hardware, changes in the underlying hypervisor stack, or even undocumented interactions between the policies of the computing resource provider and the active workloads. Thus, performance measurements obtained on clouds, HPC facilities, and, more generally, datacenter environments are almost guaranteed to exhibit performance regimes that evolve over time, which leads to undesirable nonstationarities in application performance. In this paper, we present our analysis of performance of the bare-metal hardware available on the CloudLab testbed where we focus on quantifying the evolving performance regimes using changepoint detection. We describe our findings, backed by a dataset with nearly 6.9M benchmark results collected from over 1600 machines over a period of 2 years and 9 months. These findings yield a comprehensive characterization of real-world performance variability patterns in one computing facility, a methodology for studying such patterns on other infrastructures, and contribute to a better understanding of performance variability in general.
more » « less
Full Text Available
On Studying CPU Performance of CloudLab Hardware

https://doi.org/10.1109/ICNP.2019.8888128

Duplyakin, Dmitry; Uta, Alexandru; Maricq, Aleksander; Ricci, Robert (October 2019, Proceedings of the Worksop on Midscale Education and Research Infrastructure and Tools (MERIT))

Empirical performance measurements of computer systems almost always exhibit variability and anomalies. Run-to-run and server-to-server variations are common for CPU, memory, disk, and network performance characteristics. In our previous work, we focused on taming performance variability for memory, disk, and network and established an interactive analysis service at: https://confirm.fyi/ to help users of the CloudLab testbed better plan and conduct their experiments. In this paper, we describe our analysis of CPU variability based on over 1.3M performance measurements from nearly 1,800 servers and present our initial findings. The focus of this work is on capturing hardware variability, which can make repeatable experiments more difficult and can impact conclusions; it it this important for systems researchers to understand. (We note that, though we do not study it in this work, in the cloud, multi-tenancy and resource sharing an exacerbate the problem.) Variability also inevitably impacts performance and operation of middleware and high-level applications, contributing to the straggler problems in many domains, including HPC, Big Data, and Machine Learning, and on many types of cyberinfrastructures. We analyze the data from the CloudLab servers allocated in an exclusive fashion, with no virtualization. While our analysis focuses on the testbed that aims to promote reproducible research, we believe our approach and the findings can be of value to people who manage, analyze, and utilize shared computing resources in supercomputers, clouds, and datacenters.
more » « less
Full Text Available
Mobile and wireless research on the POWDER platform

https://doi.org/10.1145/3458864.3466915

Breen, Joe; Duerig, Jonathon; Eide, Eric; Hibler, Mike; Johnson, David; Kasera, Sneha; Maas, Dustin; Orange, Alex; Patwari, Neal; Ricci, Robert; et al (June 2021, ACM MobiSys 2021)

Full Text Available
Powder: Platform for Open Wireless Data-driven Experimental Research

https://doi.org/10.1016/j.comnet.2021.108281

Breen, Joe; Buffmire, Andrew; Duerig, Jonathon; Dutt, Kevin; Eide, Eric; Ghosh, Anneswa; Hibler, Mike; Johnson, David; Kasera, Sneha Kumar; Lewis, Earl; et al (October 2021, Computer Networks)

Full Text Available
POWDER: Platform for Open Wireless Data-driven Experimental Research

https://doi.org/10.1145/3411276.3412204

Breen, Joe; Buffmire, Andrew; Duerig, Jonathon; Dutt, Kevin; Eide, Eric; Hibler, Mike; Johnson, David; Kasera, Sneha Kumar; Lewis, Earl; Maas, Dustin; et al (September 2020, WiNTECH 2020)

Full Text Available
Is Big Data Performance Reproducible in Modern Cloud Networks?

Uta, Alexandru; Custura, Alexandru; Duplyakin, Dmitry; Jimenez, Ivo; Rellermeyer, Jan; Maltzahn, Carlos; Ricci, Robert; Iosup, Alexandru (February 2029, Proceedings of the Seventeenth USENIX Symposium on Networked Systems Design and Implementation (NSDI))

Performance variability has been acknowledged as a problem for over a decade by cloud practitioners and performance engineers. Yet, our survey of top systems conferences reveals that the research community regularly disregards variability when running experiments in the cloud. Focusing on networks, we assess the impact of variability on cloud-based big-data workloads by gathering traces from mainstream commercial clouds and private research clouds. Our data collection consists of millions of datapoints gathered while transferring over 9 petabytes of data. We characterize the network variability present in our data and show that, even though commercial cloud providers implement mechanisms for quality-of-service enforcement, variability still occurs, and is even exacerbated by such mechanisms and service provider policies. We show how big-data workloads suffer from significant slowdowns and lack predictability and replicability, even when state-of-the-art experimentation techniques are used. We provide guidelines for practitioners to reduce the volatility of big data performance, making experiments more repeatable.
more » « less
Free, publicly-accessible full text available February 1, 2030

Search for: All records